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Abstract 


The aim of this research is to deliver climatic insights to clients from various 
industries, like agriculturists, scholars, and others, so that they may grasp the 
Article Info motive of changes in the climatic and atmospheric features like precipitation, 
humidity and temperature. One of the most essential areas of meteorological 
science is estimating precipitation. A combination of factual processes and ML 
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1. Introduction 


Rainfall forecasting is helpful in preventing floods, that saves lives and property. In reality, it also helps 
in management of the water supply (Namitha e al., 2015). Rainfall data from the past has aided farmers 
in better managing their crops, resulting in increased economic growth in thecountry. Rainfall prediction 
is difficult for meteorological scientists because of the variability in rainfall timing and volume. Precision 
rainfall forecasting has become one of the most pressing topics in hydrology, early warning of extreme 
weather can help to prevent natural disasters and cause damage if forecasts are made at quick moments 
and accurately (Shabib e& al., 2018). It is one of the most problematic tasks for academics in avariety of 
domains, with meteorological data mining, environmental ML, and others. Early warning, event frequency 
prediction for hydraulic design, and long-term projections for developing adaption strategies to future 
conditions all require daily and seasonal forecasts. Droughts and floods are often treated separately in all 
three forms of predictions—forecasts, frequency estimates, and projections—despite the fact that both 
types of extremes can be examined using similar methodologies and pose similar issues. There aretwo 
basic ways for forecasting rainfall in India. The most often used computational approaches for weather 
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forecasting are Regression, Decision Tree, Fuzzy logic, and team data processing processes. The main 
goal is to gather intangible and sometimes expensive knowledge by following information rules and 
relationships. ANN areapotential element of this broad field (Zainudin & al., 2016; Ridwan ée al., 2021; 
Grace and Suganya, 2020; Thirumalai & al ., 2017; Venkata Ramana et al ., 2013; and Sivapragasam é al.., 
2001). 


The paper is structured as follows: ML is defined and its relationship to Al are presented in the second 
section, along with descriptions of the most often used approaches. It gives ataxonomy of the various approaches 
to rainfall prediction that havebeen offered. Thefollowing part goes over thefindings of thestudy in terms of 
the techniques utilized and their evaluation. It identifies the obstacles encountered and suggests ways to 
overcomethem. Thepaper ends with thefinal conclusion. In Table 1, most of theexisting literatureis presented 
inthe form of table with their summary of work and used tools and algorithms for different predictions and 
detections (Cramer ef al., 2015). 


Table 1: Related Studies and Existing M odels for Prediction of Rainfall 


Authors Year of Summary Techniques / Tools 
Publication 
Namitha K ez ai. 2015 Introduce an approach to classification of 
large volume of weather dataset & ANN is 
used for evaluation of other days rainfall. 
Shabib Aftab e/ a/. It cxplains about the use of different data 
mining technologies for prediction of rainfall. 
Suhaila Zainudin Data mining methods used to compute Used various Data mining, 
etal. prediction of rainfall. techniques. 

Wanic M. Ridwan This study cxplains about various machine Used various Machine 
eral, learning models. Learning (ML) models. 
R. Kingsy Grace Describes a tool using (MLR). 
etal. 
Chandrascgar Discusses the rate of rainfall in previous Machine Learning 
Thirumalai e¢ @/. years and predicts the rainfall in future. techniques (linear regression 
method) 


ANN (Artificial Neural 
Network) 


Data mining technologies. 


(MLR) 


Dutta ef al. weather forecasting is created by the 
appliance in technology & science. It is 
created for the aggregation of quantalive 
information concerning present condition of Regression 
the environment by the meteorological 
observation posts and interpreted by 
meteorologist. 


Nkrintra Singhrattna In the post-1980 period, the ETNSO-—Indian 
etal. monsoon link has decreased significantly. 
They propose that this weakening could be Linear regression, 
caused by changes in ENSO features and Nonparametric regression, 
global warming. As most of the Indian LOCFIT 
monsoon's” predictors, this is having a 
significant impact on forecasting attempts. 

M Kannan e¢ al. Group data handling are the most cxtensively 
used empirical methodologies for climate 
prediction. The dynamical approach to 
forecasting uses physical methods rely on Regression 
systems comparisons which evaluate the 
growth of the climatic system to respond to 
initiating acrial surroundings. 

Aakash Parmar Machine learning was used to anticipate the 
etal. rainfall pattern. The ANN model has a strong 
ability to learn by adjusting these parameters Back Propagation NN, Layer 
to achieve the desired result. They indicated Recurrent Network, (SVM), 
that tiny modifications in the no. of stripes (SOM) , Cascade Forward 
and ncurons in the input, beyond making any Back Propagation Nctwork 
changes in the output parts, could be used to 
make the neural network generalize enough. 
David N. Yates Downscaling procedures, which entail 
et al. 2003 generating statistical connections between 
past meteorological data and output from 
regional and climate projections, are used to 
evaluate the expected consequences of 
climatic changes for IA analysis. 

Antonio Sarasa- Enormous number of variables on which they 
Cabezuclo are rcliant, writings on meteorological 
phenomena generate a enormous of data, K-NN Algorithm, Decision- 
making more hard to make forecasts about Random Forest Trees, 
future events. Probabilistic models, which Neural Networks 
provide forecasts with a margin of 
uncertainty, are commonly utilized. 


Hung ef al. Predict the plans and manages the water 
resources, accurate rainfall data is critical. 

Jyoti Upadhyay 2012 Calculate the impact of climate variables and 

etal. changes on Assam's_ rice  productivities. 


Authors use time & scrics data trom 1970- 
2010 for the various variables. Conclusion 
given by authors is that by _ increasing 
temperature yield of crop become less and if 
the temperature is posilive productivily of 
crop become more. 
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Table 1 (Cont.) 


Utpal Misra ez al. 


Venkata Ramana 
etal. 


Nikhil Oswal 


2018 


2013 


2019 


The prediction of merged rainfall is a recent 
trending in which precipitation is estimated 
using, a mix of satellite, rain gauge, and NWP 
model rainfall. The underlying concern of 
each component's accuracy is currently being 
investigated. It's also worth noting that finer 
scales, the rainfall product beats model 
output. 

Predict rainfall with the use of wavelet 
technique with ANN model on rainfall data 
of Darjeeling rain gauge. Analysis of paper 
tell that performance of WNN is higher than 
ANN. 

Predict seeks to give ML life process, from 
data pre-processing to model implementation 
and evaluation. The data pre-processing 
process includes procedures such as missing 
value replacement, feature transformation, 
categorical feature encoding, feature scaling, 
and feature selection. 


NN 


The Wavelet & The ANN 
model 


Accuracy, Precision, Recall 


Sivapragasam 
Chandrasekaran 
et al. 


2001 


Purposed a technique on SSA with SVM on 
Singapore rainfall data. Authors compare 
results with non-linear prediction (NLP) 
method and conclude’ that purposed 
technique give higher accuracy than NLP. 


(SSA) combined (SVM) 


Sam Cramer et al. 


2015 


Predict rainfall with the use of Genetic 
Programming (GP). Authors also compare 
GP with MCRP amongst 21 data sets of cities 
in Europe. Paper concludes that GP method 
is better than MCRP method. 


MCRP and GP method. 


Minghui Qiu et al. 


2017 


predicted a short-term rainfall with the help 
of multiple neural network and deep learning 
techniques and paper conclude that proposed 
approach is better than other models. 


Multitask convolutional 
neural network and deep 
learning techniques 


Santosh Kumar 
Nanda e¢ al. 


2013 


Uses ARIMA and different ANN models like 
MLP, FLANN and LPE to predict the rainfall 
prediction. series model. 


ARIMA models and MLP, 
FLANN and LPE models. 


Maya L. Pai et al. 


2014 


Purposed to predict Southwest Indian 
Monsoon rainfall using ANN with clustering 
approach. Author predicted the rainfall for 6 
years and model was trained for 36 years, 


ANN based on Clustering 
Approaches 


‘fomoaki Kashiwaoa 
et al. 


2017 


Used to develop an accurate prediction with 
the help of various ANN models like models 
like MLP and RBFN. Conclusion of this 
paper is that efficiency of the MLP model is 
good as compared to RBFN model for the 
prediction. 


Different ANN models 
(RBFN) and (MLP) with a 
hybrid algorithm comprising 
of (BP) and (RO) 
approaches. 


Kyaw Htike et al. 


2010 


Uses an approach of FTDNN) to gencrate 
rainfall forecasting models comprising 
different datasets. Each dataset used by 
authors contain 29 ycars data. Paper 
concludes: yearly rainfall dataset gives better 
& accuracy as compared to monthly, 
quarterly dataset. 


FTDNN 


Fromtheabove literature, various common challenges are identified to rainfall prediction. Furthermore, a 
discussion on potential approaches for tackling these challenges is taken place. 


2. M ethodology 


Theartificial intelligence-based methodology for the prediction of rainfall data is illustrated in Figure 1, which 
shows a step-by-step procedure for predicting rainfall. Becauseof the greater processing costs and memory 
requirements, high-dimensional data poses a challengefor classification systems. To correct this, dimensionality 
deducting techniques likefeature reduction and featureselection were applied (Pai et al., 2014). 


Feature extraction and selection algorithms areused separately or in combination to increase performance, 
estimated accuracy, visualization, and information comprehension. The benefit of enploying feature selection 
is that the significance of any small piece of information is not overlooked, however when evaluating ahuge 
pool of data with many different features, certain features may be overlooked (Htike and Khalifa, 2010). The 
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Figure 1: Basic M ethodology Used for Rainfall Prediction UsingML and DL. 


size of the featurein feature extraction. Choosing between thesetwo techniques is based solely upon thetype 
and domain of the specific task. 


Thefinal aim of the prediction models is to estimatethe rainfall in futureon test data. This can be possible by 
usingsimplesteps as importing datasets, extracting independent and dependent variable, distributing dataset 
into training and testing set and feature sorting (Kashiwao et al., 2017). 


3. Artificial Intelligence, Machine Learning and Deep Learning for Rainfall Prediction 


Classic definition of Artificial Intelligence (A1) is “weak.” It becomes capable of recreating human behavior 
with awareness, sensitivity, and spirit as it becomes stronger. The advent of M achineLearning (ML) provided 
the means to take a step toward realizing this goal. It works on the premise of recreating a behavior using 
algorithms that arefed by enormous amounts of data. When confronted with avariety of scenarios, thealgorithm 
learned whichever judgment to be made and buildsa model. As aresult, computer can automate operations 
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Figure 2: Classifications of M achine_Learming Algorithms for Predicting and A nalyzing Weather D ata 
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based on the circumstances (Yates et al., 2003; Sarasa-Cabezuelo, 2022; Hung é& al., 2009; and Upadhyay, 
2012). The general way to do aM achineLearning need atraining dataset, atest dataset, and an algorithm to 
propagatea predictive modd as shown in Figure2. In each category, therearedifferent algorithms. Thereare 
various algorithms used for analyzing weather data for predicting rainfall, some of them are discussed as 
follows. 


3.1.R andom Forest 


Inthesupervised learning technique, itisacommon ML Algo. It used to tackleM L categorization and regression 
problems, ensemble learning isa method for tackling complicated issueand improving models by integrating 
multiple classifiers efficiency. “Random Forest is a classifier that consists of a number of decision trees on 
different subsets of a given dataset and takes the averageto enhancethe predicted accuracy of that dataset”. 
Therandom forest predicts theend outcome vote of predictions by summing most projections from each tree, 
despite of trusting on asingledecision tree (Dutta and Tahbilder, 2014). The functioning of the Random Forest 
algorithm can be understood in Figure 3 and using the following steps”. 


Figure 3: Illustration of Random Forest Algorithm 


Step 1: Begin the collection of data from the acquired dataset. 

Step 2: For each data collected, thealgorithm will build a decision tree. Theforecast result will then beobtained 
by any decision tree. 

Step 3: Voting for predicted result will beheld at this stage. 


Step 4: As the forecast’s final result, choose the outcome of the most popular prediction. Random Forest 
regression from sklearn is utilized for the training moda. 


3.2. Linear Regression M ethod 


Linear regression can betreated as oneof the most conventional machinelearning technologies. InL R model, 
the dataset used as theinput that isthe pre-processing stage (H tikeand Khalifa, 2010). Thefeatureis taken by 
use of the LR Model as shown in Figure 4. 


3.3. Artificial Neural Network 


As shown in Figure 5, an ANN is defined by aneuron mode!: the NN’s information processing unit, an 
architecture: aset of neurons, and linkages connecting neurons. Each link includes a weight, which is used to 
traintheNN by altering the weights to accurately represent a specific learning task on thetraining samples. 


(1) Back-Propagation N eural Network: The MLFF NN that makes up BPNN has one input layer, hidden 
layers, and one output layer. One hidden layer in a BPNN design. By modifying the weights after each 
iteration, the aim of BPNN is to reduce the estimated issue obtained from the discrepancy b/ w both the 
computed output and desired of theneural network (Sivapragasam et al., 2001; and Cramer et al., 2015). 
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Figure 4: Simple Linear Regression M odel 


Input Layer Hidden Layer Output Layer 


Predictor Feature 1 


Predictor Feature 2 


Predictor Feature 3 


Predictor Feature 11 


Figure 5: Basic Structure of ANN for Predicting V alues 


(2) Cascade Forward Back Propagation: CFBP is atypeof ANN, used to predict thenew output data. All the 
layers rdateto its previous layer and input layers. Each layer in network is provided with different inputs (Pai 
et al., 2014). 


(3) Layer Recurrent N etwork: In layer recurrent network directed cycleis created by connecting between units. 
Opposite to feed-forward neural network, it uses internal to compute arbitrary sequence of inputs. They are 
feedback loop neural network. Theprocesses in thelayers that are hidden, and function outputs aredelivered 
to network for input in next process (Kashiwao et al., 2017). 


3.4. Support Vector M achine 


The SVM’'s aim is finding hyperplane in an N-dimensional plot it differentiates among dataset spikes as 
depicted in Figure 6. These various hyperplanes that to find to dividethetwo groups of data points. Theaim 
isto find the planewith the largest margin, or distance across data picks from classes (IKashiwao et al., 2017). 
Extending the path that provides reinforcenent, making future data easier too. 


3.5. ARIMA Model 


Itisused to analysefuture output based on previous values. Itis blend of threeoperators (Wu 4 al., 2013), Auto 
regressive, moving average and integration as shown in Figure 7. 


Inthis work, explanation of various common algorithms for rainfall prediction is identified. There can be 
variousissues that can affect thesolution of this problem. A limited datafrom meteorology department or any 
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Figure 7: Working Flow of ARIMA Model 


different weather conditions (Nanda é& al., 2013; Dabhi and Chaudhary, 2014; Banerjee et al., 1978; DelSole 
and Shukla, 2002; Ganguly and Bras, 2003; Kawasaki and Herath 2011). Here, author also explains the 
variouSNN and ML based methods for thedetection and right rainfall data prediction. 


4. Conclusion 


Water ResourceM anagement and their environment, rainfall estimation is critical. It can bemet with erroneous 
or insufficient information. Because rainfall estimation is impacted, there are challenges with estimation. 
From changes in geographical and regional features and many more. This study provided an overview of 
various ways of rainfall forecasting and issues onecan comeacross while different ways of rainfall forecasting 
are being used (Anctil ef al., 2004). Due to rainfall data with nonlinear correlations and learning agility 
artificial neural network is a better alternate than the other approaches based on all accessible options. This 
study looked at themany methods for forecasting and anticipating rainfall, as well as theissues that can occur 
whe employing different methodologies. Accordingtothesurvey, most researchers employed back propagation 
networks to predict rainfall and received substantial results. Thesurvey also concludes that (MLP), (SOM), 
and (SVM) forecasting approaches are more suitable for predicting rainfall than some other forecasting 
techniques likenumerical and statistical techniques. 
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